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Abstract The equivalence of two y? forms for linear function fit is proved by mathemat- 
ical calculation. The simplified R-value measurements are quoted to test the conclusion 


quantitatively. 
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1 Introduction 


The covariance matrix is usually used to construct the x? estimator for correlative 
data which is to be minimized to acquire best estimates for parameters interested H. It is 
frequently the case that experimental data are affected by overall systematic error, such 
as the error of luminosity or efficiency in scan experiment. As a typical example without 
lossing generality, let n; be the selected number of event for certain final state at the 
i-th energy point, and e the corresponding efficiency for all measured points. Then the 
number of measured event is calculated as 


Yrs: 
€ 


Since all y; contain the same €, they are correlated and e here could be treated as a 
normalization factor. Under such case, the n x n covariance matrix V for n measurements 


could be constructed as follows: the diagonal elements are given by 
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'This is a fairly good approximation for the efficiency at continuum region, where no resonance exists. 


where o; is the statistic uncertainty of n; and oy is the relative uncertainty of e€, the quan- 
tity yop is the normalization uncertainty due to factor e for variable y;. The correlation 
between data points 7 and j contributes to the off-diagonal matrix element v;j, which is 


depicted by the product of two normalization uncertainties, that is 
= 2 
Vig = YijOFz - 


Explicitly, the convariance matrix has the form? 
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The expression to be minimized is then? 


Gear Vv i (2) 
where 
Yi — ky 
Yo — k2 
n= : 
Yn — Kn 


is the vector of the residuals between experimental observations y; and theoretical expec- 
tation k;. 

Apart from matrix method, an alternative way to handle correlation problem is 
called factor method © #. A normalization factor f is introduced and is to be fitted as 
a free parameters to take the correlations into account 

2 2 
X= ae r ayp z (3) 
where o; and oy have the same meaning as before. 

Actually, we can comprehend the Eq.(3) from an experimental point of view. We 
treat the efficiency € also as a measured variable and introduce a fitting parameter €o as 
its expected value, then we construct a chisquare form for n+1 uncorrelated measurements 
as follows: EEE 

a r t T o 
2 According to the error propagation forumla, we can derive the convariance matrix fair formally, more 


details can be found in Ref. [5]. 
3In this paper, chi-square minimization is adopted to obtain the best estimated value. For experimental 


data minimization, the MINUIT package is used. The detail explanation about chi-square minimization 
technique and MINUIT package can be found in Ref. [2]. 


where de is the efficiency uncertainty, and y} = n;/€9, which can be rewritten as 


Here we define the f = €/é9. At the same time, we change the last term form of Eq. (4), 


(e/eo — €0/€0)* _ (f — 1)? 


(de/eo)? (de eo)? ` 


Exprimentally, €o could be replaced by the measurable quantity €, so ĝe/co ~ d€/e = of. 


viz. 


Under such case, we immediately recover Eq.(3). 

Now there are two x° forms, as expressed in Eqs.(2) and (3), which have been used to 
deal with the correlated data. As a matter of fact, the equivalence between these two x? 
forms was first discussed in detail by D’Agostini in Ref. [5]. At the same time, D’ Agostini 
pointed out the biasness of these two estimators. Nevertheless, the D’Agostini’s study 


was restricted within the two-variate and constant fitting case. It is natural to ask 
1. is the equivalence of two x? forms a general conclusion? 
2. does the biasness of two x? forms exist always? 


As to the first question, the equivalent conclusion has been expanded to the multi-variate 
and constant fitting case | 7]. Recently, effects have been made in order to extend equiv- 
alent conclusion to the non-constant fitting case. As the first step, the linear dependent 
case is considered in this paper, and the equivalence is proved strictly based on mathemat- 
ical calculation. So far as the second question is concerned, the following study indicates 
for linear function fit, the biasness still exist. Fortunately, by virtue of factor method, the 
biasness due to fit can be corrected through the fitted factor. 

Besides the exprimental requirement, statistically, the analytical results are appreci- 
ated which could provide some qualitatively understandings towards more complicated 
problems and some new clues for further exploration. In the following sections, we first 
work out the minimization values of two x? forms, exactly the same forms of which lead 
to equivalent conclusion directly. Then some simplified experiment results are quoted to 


test the equivalence quantitatively. 


2 Equivalence Proof 


Eqs. (2) and (3) are the two x° forms we will study. Where k; linearly depends on i, 
or 
However in actual experiment, the physical attribute is often indicated by a physical 


variable instead of the sequence of experiment. For the i-th experiment at energy Eem = 


zi, we usually use x; instead of 7 to denote such experiment. So more practically, we write 
the linear form of k; as 
kj =a-a,+ 6 . (5) 


With above expression, Eq. (2) reads explicitly 


aS pe ee (6) 


t=1 j=1 


&. 
II 


Here, the symbol (V~');; has been changed into A;; for the convenience of notation. For 


the factor method, chi-square has the comparatively simplified form: 


2 
0; OF 
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In above equations, subscripts M and f indicate the matrix and the factor methods, 


respectively. 


2.1 Expectation and variance from covariance matrix method 
For briefness, we introduce matrix notation 


xu Xu 
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According to the minimization condition 


Og a 
OX —0 
ob : 


we have 


In above equations, we have defined 
Tee = X tidijtj , Te = È tidi = VAG; , 
ij ij ij 
coe = X BAGGY; Ty = D AijYj ; and To = D Aij : 
ij ij ij 


Solving Eq. (8), we obtain 


and covariance of œ and 8 
on o 1 To 
o3 E Der Trz , 


with 


(10) 


By virtue of definitions in Eqs. (9) and (10), and formulas (A. 2) and (A. 3) in appendix, 


we can work out the following results 
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2.2 Expectation and variance from factor method 


For briefness, we introduce matrix notation 


Jada að Oad A D -E 
D B -F 
På PL PX -E -F C 
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we have 


a 0 
Al 6 |=| 0 (14) 
f A 
In above equations, we have defined 
cH l 1 Ui 
ee Bahia, C=Gtd 4G, 
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Solving Eq. (14), and utilizing the definitions in Eq. (15), we obtain 
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By virtue of Eqs. (13) and (18), S - Dr = of - Da, then comparing Eqs. (11) and (12) 
with Eqs. (16) and (17), the exactly same analytical results of corresponding quantities 


of two methods show their equivalence directly. 


2.3 Discussion 


If we denote f;(7) as a function at i-th point of any measurements 7 = 


with uncertainty o;, the weighted average of f;(7) can be defined as 


16 - il 


with 


— “1 2 — “1 
=e or 0; = a i 


Using above definition, we write the optimized quantities of Eqs. (16) and (17) in another 


forms, for example, we have 


and 


S 
It is interesting to notice, from Eqs. (19) and (20), we obtain the following relation 
gef. OF. (21) 


Furthermore, taking advantage of the forms as displayed in Eqs. (19) and (20), we can 
acquire the relation 


A 


f-y=h+a-z. (22) 


At the same time, from Eq. (5), we have 
ke =B+@-a. (23) 
Here subscript denotes the dependence of k on x. So combining with Eq. (22), we get 
ke = f J. (24) 


Comparing with the results in Ref. [7], we find the relations expressed in Eqs. (21) and 
(24) hold for both the constant and the linear fitting cases. 


3 Experimental test 


For the purpose of illustration, the application of the theoretical formulas to a simpli- 
fied R value measurement is discussed in this section. 

R, the ratio of the hadron production cross section via single photon annihilation to 
the lowest order point-like QED utu cross section op = 4707/38, is a fundamental 
quantity in ete” interaction. It is calculated in the naive quark-parton model as R = 
34 a where Qg is the quark electric charge, and the summation runs over all the 
produced flavors. Taking the lowest order QCD correction and the electro-weak effect 
into consideration, R value would be larger than the naive value (10/3), and the corrected 


term is a slowly varying smooth function of center-of-mass (C.M.) energy, in the region 


T 


Table 1: Values for RI. The errors quoted are point-to-point systematic errors. 


Ee R value Error AR | Eem R value Error AR 


(GeV) (GeV) 
5.60 4.08 0.32 6.60 4.50 0.17 
5.70 4.09 0.16 6.65 4.25 0.16 
5.75 4.12 0.20 6.70 4.63 0.15 
5.80 4.13 0.16 6.75 4.38 0.15 
5.85 4.13 0.19 6.80 4.44 0.16 
5.90 4.09 0.14 6.85 4.50 0.13 
5.95 4.17 0.16 6.90 4.41 0.15 
6.00 4.17 0.09 6.95 4.23 0.17 
6.05 4.16 0.18 7.00 4.10 0.12 
6.10 4.04 0.15 7.05 4.31 0.09 
6.15 4.34 0.16 7.10 4.32 0.14 
6.20 4.05 0.08 7.15 4.29 0.11 
6.25 3.96 0.14 7.20 4.27 0.11 
6.30 4.27 0.14 7.25 4.39 0.11 
6.35 4.47 0.17 7.30 4.29 0.11 
6.40 4.31 0.13 7.35 4.33 0.09 
6.45 4.23 0.14 7.40 4.46 0.08 
6.50 4.40 0.15 7.45 4.51 0.14 
6.55 4.66 0.16 7.50 4.18 0.59 


without any resonances, therefore, R could reasonably be depicted by a linear function in 
a good approximation. 
In experiment, many factors should be considered in R value calculation’. As a ped- 


agogical example, a comparatively concise R expression |! is given here 


2 (N — Nog) 
Le(1 +ô) -op ` 


where N is the number of multi-hadronic events detected, Nog is the estimated number 
of background events, L is the integrated luminosity, e(1 + ô) is the acceptance for the 
multi-hadronic events with radiative effect included and (1 + ô) is the radiative correc- 
tion factor due to higher order QED processes up to order a3. Table 1 lists thirty eight 


experiment R-values!!. From a study of data taken at different times at the same C.M. 


“The R value measurement at BESII has been described in Refs. [10, 11], where the detailed calculation 
about experiment R value could be found. 


energy, the estimated systematic point-to-point errors are given as +3%. For the R value 
used here, the systematic uncertainty in the detection efficiency (48%), the luminosity 
measurement (+6%), the event selection procedure (+2%), and the background substrac- 
tion (43%) yielded a common systematic error of +10%, which should be considered as 
the normalization error. Now these thirty eight R-values will be used to test foregoing 
conclusions. For minimization, the MINUIT package, one of useful CERN packages in 
high energy physics "l, is utilized. 
In the x? construction, the following substitutes are adopted 


eS E 


cm ? 


ees 


exp. ? 


yi > R BR, and ayn , 


exp. ? 
then Eqs. (7) and (6) become 
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Xu = D [Reep. — (Ro ale nE m) (V7 ig (Poop. — (Ro EJ nE! | ’ 
aj 
where of is the overall error of normalization factor f, which equals to 10%, the element 
(vij) of matrix V reads 


Vij = Oij ` (AR a + oF: Roe 


exp. exp. * 


The fitting results are summarized in Table 2. At the same time, using Eqs. (16) and (17), 
we can compute the corresponding values theoretically, which are also given in Table 2 
We can see different method leads to the same results up to the significant digits listed 


in the table. In addition, with these values, we can also test the simple relation given in 
Eq. (21). 


Table 2: Experimental fitted and theoretical calculated values of parameters and relevant 


information. 
Parameter | Matrix method Factor method Theoretical calculation 
Ro 2.2895 + 0.3772 2.2895 + 0.3772 2.2895 + 0.3772 
n 0.1241 + 0.0421 0.1241 + 0.0421 0.1241 + 0.0421 
F — 0.7282 + 0.0853 0.7282 + 0.0853 
?/d.o.f 27.18/35 27.18/35 — 


The fitted values relevant to Rọ and 7 show the minimization equivalence of two x? 
forms fairly well. Next we turn to another aspect of two x? forms, that is the fit biasness, 


which has been noticed in previous papers 5 7 8, 


9 


Fig. 1 shows the fitting result, where the solid line represents the best fitted k value. It 
is rather obvious that the fitted line is far below all data points. Merely by matrix x? fit, 
we are at a loss what to do with this deviation. On the contrary, the factor x? fit provides 
us a normalization factor f which just manifests the magnitude of the biasness. In fact, 
what we want to know is the weighted average of experiment data, that is y, which should 
not contain the biasness due to common uncertainty. Eq. (24) gives the relation between 
k and J, from which we can obtain 7 by scaling k with the factor f. The the dashed line 
in Fig. 1 denotes the factor scaled k value or expected 7. We can see the re-scaled line fits 
the experiment data fairly well’. After rescaled, Ro/f = 3.14 which is fairly consistent 
with the naive expectation 10/3 = 3.3. 
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Figure 1: The R value, error bars indicate point-to-point systematic errors. The data 
points taken from Ref. [4]. The solid line represents the best fitted k value, while the 


dashed line the factor scaled k value or expected J. 


4 Summary 


For linear function fitting, two x? forms have been constructed to handle correlated 
data. The equivalence of these two forms has been proved strictly by mathematical calcu- 
lation, and tested quantitatively by a typical simplified R value measurement experiment. 
However, in light of the comparison of two x? forms, we notice that the factorized x? form 


is more transparent, besides the concise expression and comparative easy calculation in 


In a forthcoming paper [12], we will examine the biasness of x} for linear function fit in more detail. 
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minimization, the cause of the biasness is more easily recognized and corrected by the 
scale factor. By contrast, for matrix x? form the biasness is uncontrolled. 

The importance of this work lies in two aspects, first, the equivalence conclusion has 
been extended from constant fitting to linear function fitting; second, the analytical forms 
of the minimized parameters are worked out explicitly, which display some qualitatively 
features about more complex fitting. 

At last, a remark is in order. Comparing with the formulas of the constant fitting, 
the formulas presented in this paper are more complicated and less intuitive, which im- 
plies that further extension of equivalence conclusion might be much difficult, or another 


approach should be found for equivalence study. 


Appendix: Matrix Formulas 


Some necessary matrix formulas are collected in this appendix, as to the knowledge 
of matrix, see Ref. [13]. 


Let’s consider a special square matrix A, 


xı +ô aybotd +--+ abn +ô 
dob, +6 z2+ô --- azbn +6 

Aal m ee , (A. 1) 
dnb, +ô anb2 + 6 sagen fn +Ô 


If its matrix element reads 
Qij = on s (zi = a;b;) + a;b; + ô ; 


then its element of the inverse matrix could be worked out 


-1_ On aibj + ô ô 2 (ai = ak) $ (b; = by) 


with sei r 
2. aibi + aibi 
au e aibi 7 [ae _ on 
In addition, its determinant has also been proved to be 
+6 ajybotd +--+ abn +ô 
aht mtd abt] g ię 7) 
: : . : i=1 
anbi +6 anb2 +ô +--+: y+ 
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In fact, the error matrix given in equation (1) is just a special form of that given in 


equation (A. 1). So with above formulas, it is easy to acquire the corresponding results 


of matrix V. If the element of V reads 
vig = 65307 + Yi iF ; 


then the element of its inverse matrix is 


i= as! Spares ec 
180 GS” Ga? 
with i 
2y 
S=1lt+o07-S 2 
mra 
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